Overview
Brought to you by YData
Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 2930 |
| Missing cells | 2 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 3 |
| Duplicate rows (%) | 0.1% |
| Total size in memory | 375.0 KiB |
| Average record size in memory | 131.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 2 |
| Dataset has 3 (0.1%) duplicate rows | Duplicates |
Bedroom AbvGr is highly overall correlated with Gr Liv Area | High correlation |
Garage Cars is highly overall correlated with Gr Liv Area and 3 other fields | High correlation |
Gr Liv Area is highly overall correlated with Bedroom AbvGr and 3 other fields | High correlation |
Overall Qual is highly overall correlated with Garage Cars and 3 other fields | High correlation |
SalePrice is highly overall correlated with Garage Cars and 4 other fields | High correlation |
Total Bsmt SF is highly overall correlated with SalePrice | High correlation |
Year Built is highly overall correlated with Garage Cars and 2 other fields | High correlation |
Garage Cars has 157 (5.4%) zeros | Zeros |
Total Bsmt SF has 79 (2.7%) zeros | Zeros |
Reproduction
| Analysis started | 2025-05-05 15:26:55.465095 |
|---|---|
| Analysis finished | 2025-05-05 15:27:02.526347 |
| Duration | 7.06 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
Lot Area
Real number (ℝ)
| Distinct | 1960 |
|---|---|
| Distinct (%) | 66.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10147.922 |
| Minimum | 1300 |
|---|---|
| Maximum | 215245 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 23.0 KiB |
Quantile statistics
| Minimum | 1300 |
|---|---|
| 5-th percentile | 3188.3 |
| Q1 | 7440.25 |
| median | 9436.5 |
| Q3 | 11555.25 |
| 95-th percentile | 17131 |
| Maximum | 215245 |
| Range | 213945 |
| Interquartile range (IQR) | 4115 |
Descriptive statistics
| Standard deviation | 7880.0178 |
|---|---|
| Coefficient of variation (CV) | 0.77651542 |
| Kurtosis | 265.02367 |
| Mean | 10147.922 |
| Median Absolute Deviation (MAD) | 2040 |
| Skewness | 12.820898 |
| Sum | 29733411 |
| Variance | 62094680 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9600 | 44 | 1.5% |
| 7200 | 43 | 1.5% |
| 6000 | 34 | 1.2% |
| 9000 | 29 | 1.0% |
| 10800 | 25 | 0.9% |
| 8400 | 21 | 0.7% |
| 7500 | 21 | 0.7% |
| 6240 | 18 | 0.6% |
| 1680 | 18 | 0.6% |
| 6120 | 17 | 0.6% |
| Other values (1950) | 2660 |
| Value | Count | Frequency (%) |
| 1300 | 1 | |
| 1470 | 1 | |
| 1476 | 1 | |
| 1477 | 2 | |
| 1484 | 1 | |
| 1488 | 1 | |
| 1491 | 1 | |
| 1495 | 1 | |
| 1504 | 1 | |
| 1526 | 2 |
| Value | Count | Frequency (%) |
| 215245 | 1 | |
| 164660 | 1 | |
| 159000 | 1 | |
| 115149 | 1 | |
| 70761 | 1 | |
| 63887 | 1 | |
| 57200 | 1 | |
| 56600 | 1 | |
| 53504 | 1 | |
| 53227 | 1 |
Overall Qual
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.0948805 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 23.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 5 |
| median | 6 |
| Q3 | 7 |
| 95-th percentile | 8 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.4110261 |
|---|---|
| Coefficient of variation (CV) | 0.23151005 |
| Kurtosis | 0.05241245 |
| Mean | 6.0948805 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.19063396 |
| Sum | 17858 |
| Variance | 1.9909946 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 825 | |
| 6 | 732 | |
| 7 | 602 | |
| 8 | 350 | |
| 4 | 226 | 7.7% |
| 9 | 107 | 3.7% |
| 3 | 40 | 1.4% |
| 10 | 31 | 1.1% |
| 2 | 13 | 0.4% |
| 1 | 4 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 4 | 0.1% |
| 2 | 13 | 0.4% |
| 3 | 40 | 1.4% |
| 4 | 226 | 7.7% |
| 5 | 825 | |
| 6 | 732 | |
| 7 | 602 | |
| 8 | 350 | |
| 9 | 107 | 3.7% |
| 10 | 31 | 1.1% |
| Value | Count | Frequency (%) |
| 10 | 31 | 1.1% |
| 9 | 107 | 3.7% |
| 8 | 350 | |
| 7 | 602 | |
| 6 | 732 | |
| 5 | 825 | |
| 4 | 226 | 7.7% |
| 3 | 40 | 1.4% |
| 2 | 13 | 0.4% |
| 1 | 4 | 0.1% |
Year Built
Real number (ℝ)
High correlation 
| Distinct | 118 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1971.3563 |
| Minimum | 1872 |
|---|---|
| Maximum | 2010 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 23.0 KiB |
Quantile statistics
| Minimum | 1872 |
|---|---|
| 5-th percentile | 1915 |
| Q1 | 1954 |
| median | 1973 |
| Q3 | 2001 |
| 95-th percentile | 2007 |
| Maximum | 2010 |
| Range | 138 |
| Interquartile range (IQR) | 47 |
Descriptive statistics
| Standard deviation | 30.245361 |
|---|---|
| Coefficient of variation (CV) | 0.015342412 |
| Kurtosis | -0.50171504 |
| Mean | 1971.3563 |
| Median Absolute Deviation (MAD) | 25 |
| Skewness | -0.60446222 |
| Sum | 5776074 |
| Variance | 914.78184 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2005 | 142 | 4.8% |
| 2006 | 138 | 4.7% |
| 2007 | 109 | 3.7% |
| 2004 | 99 | 3.4% |
| 2003 | 88 | 3.0% |
| 1977 | 57 | 1.9% |
| 1920 | 57 | 1.9% |
| 1976 | 54 | 1.8% |
| 1999 | 52 | 1.8% |
| 2008 | 49 | 1.7% |
| Other values (108) | 2085 |
| Value | Count | Frequency (%) |
| 1872 | 1 | < 0.1% |
| 1875 | 1 | < 0.1% |
| 1879 | 1 | < 0.1% |
| 1880 | 5 | |
| 1882 | 1 | < 0.1% |
| 1885 | 2 | 0.1% |
| 1890 | 7 | |
| 1892 | 2 | 0.1% |
| 1893 | 1 | < 0.1% |
| 1895 | 3 |
| Value | Count | Frequency (%) |
| 2010 | 3 | 0.1% |
| 2009 | 25 | 0.9% |
| 2008 | 49 | 1.7% |
| 2007 | 109 | |
| 2006 | 138 | |
| 2005 | 142 | |
| 2004 | 99 | |
| 2003 | 88 | |
| 2002 | 47 | 1.6% |
| 2001 | 35 | 1.2% |
Gr Liv Area
Real number (ℝ)
High correlation 
| Distinct | 1292 |
|---|---|
| Distinct (%) | 44.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1499.6904 |
| Minimum | 334 |
|---|---|
| Maximum | 5642 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 23.0 KiB |
Quantile statistics
| Minimum | 334 |
|---|---|
| 5-th percentile | 861 |
| Q1 | 1126 |
| median | 1442 |
| Q3 | 1742.75 |
| 95-th percentile | 2463.1 |
| Maximum | 5642 |
| Range | 5308 |
| Interquartile range (IQR) | 616.75 |
Descriptive statistics
| Standard deviation | 505.50889 |
|---|---|
| Coefficient of variation (CV) | 0.33707549 |
| Kurtosis | 4.1378382 |
| Mean | 1499.6904 |
| Median Absolute Deviation (MAD) | 311 |
| Skewness | 1.2741097 |
| Sum | 4394093 |
| Variance | 255539.24 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 864 | 41 | 1.4% |
| 1092 | 26 | 0.9% |
| 1040 | 25 | 0.9% |
| 1456 | 20 | 0.7% |
| 1200 | 18 | 0.6% |
| 894 | 15 | 0.5% |
| 912 | 14 | 0.5% |
| 816 | 14 | 0.5% |
| 1728 | 13 | 0.4% |
| 848 | 13 | 0.4% |
| Other values (1282) | 2731 |
| Value | Count | Frequency (%) |
| 334 | 1 | |
| 407 | 1 | |
| 438 | 1 | |
| 480 | 1 | |
| 492 | 1 | |
| 498 | 1 | |
| 520 | 1 | |
| 540 | 1 | |
| 572 | 1 | |
| 599 | 1 |
| Value | Count | Frequency (%) |
| 5642 | 1 | |
| 5095 | 1 | |
| 4676 | 1 | |
| 4476 | 1 | |
| 4316 | 1 | |
| 3820 | 1 | |
| 3672 | 1 | |
| 3627 | 1 | |
| 3608 | 1 | |
| 3500 | 1 |
Garage Cars
Real number (ℝ)
High correlation  Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.7668146 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 157 |
| Zeros (%) | 5.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 23.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.76056636 |
|---|---|
| Coefficient of variation (CV) | 0.43047321 |
| Kurtosis | 0.24496945 |
| Mean | 1.7668146 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.21983636 |
| Sum | 5175 |
| Variance | 0.5784612 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 1603 | |
| 1 | 778 | |
| 3 | 374 | 12.8% |
| 0 | 157 | 5.4% |
| 4 | 16 | 0.5% |
| 5 | 1 | < 0.1% |
| (Missing) | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 157 | 5.4% |
| 1 | 778 | |
| 2 | 1603 | |
| 3 | 374 | 12.8% |
| 4 | 16 | 0.5% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 1 | < 0.1% |
| 4 | 16 | 0.5% |
| 3 | 374 | 12.8% |
| 2 | 1603 | |
| 1 | 778 | |
| 0 | 157 | 5.4% |
Total Bsmt SF
Real number (ℝ)
High correlation  Zeros 
| Distinct | 1058 |
|---|---|
| Distinct (%) | 36.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1051.6145 |
| Minimum | 0 |
|---|---|
| Maximum | 6110 |
| Zeros | 79 |
| Zeros (%) | 2.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 23.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 453 |
| Q1 | 793 |
| median | 990 |
| Q3 | 1302 |
| 95-th percentile | 1776 |
| Maximum | 6110 |
| Range | 6110 |
| Interquartile range (IQR) | 509 |
Descriptive statistics
| Standard deviation | 440.61507 |
|---|---|
| Coefficient of variation (CV) | 0.41898913 |
| Kurtosis | 9.1356123 |
| Mean | 1051.6145 |
| Median Absolute Deviation (MAD) | 236 |
| Skewness | 1.1562043 |
| Sum | 3080179 |
| Variance | 194141.64 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 79 | 2.7% |
| 864 | 74 | 2.5% |
| 672 | 29 | 1.0% |
| 912 | 26 | 0.9% |
| 1040 | 25 | 0.9% |
| 768 | 24 | 0.8% |
| 816 | 23 | 0.8% |
| 728 | 21 | 0.7% |
| 384 | 19 | 0.6% |
| 1008 | 19 | 0.6% |
| Other values (1048) | 2590 |
| Value | Count | Frequency (%) |
| 0 | 79 | |
| 105 | 1 | < 0.1% |
| 160 | 1 | < 0.1% |
| 173 | 1 | < 0.1% |
| 190 | 1 | < 0.1% |
| 192 | 1 | < 0.1% |
| 216 | 2 | 0.1% |
| 240 | 1 | < 0.1% |
| 245 | 1 | < 0.1% |
| 264 | 4 | 0.1% |
| Value | Count | Frequency (%) |
| 6110 | 1 | |
| 5095 | 1 | |
| 3206 | 1 | |
| 3200 | 1 | |
| 3138 | 1 | |
| 3094 | 1 | |
| 2846 | 1 | |
| 2660 | 1 | |
| 2633 | 1 | |
| 2630 | 1 |
Full Bath
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 166.1 KiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 64 |
| 0 | 12 |
| 4 | 4 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 1532 | |
| 1 | 1318 | |
| 3 | 64 | 2.2% |
| 0 | 12 | 0.4% |
| 4 | 4 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 1532 | |
| 1 | 1318 | |
| 3 | 64 | 2.2% |
| 0 | 12 | 0.4% |
| 4 | 4 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1532 | |
| 1 | 1318 | |
| 3 | 64 | 2.2% |
| 0 | 12 | 0.4% |
| 4 | 4 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2930 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 1532 | |
| 1 | 1318 | |
| 3 | 64 | 2.2% |
| 0 | 12 | 0.4% |
| 4 | 4 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2930 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 1532 | |
| 1 | 1318 | |
| 3 | 64 | 2.2% |
| 0 | 12 | 0.4% |
| 4 | 4 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2930 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 1532 | |
| 1 | 1318 | |
| 3 | 64 | 2.2% |
| 0 | 12 | 0.4% |
| 4 | 4 | 0.1% |
Bedroom AbvGr
Real number (ℝ)
High correlation 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.8542662 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 8 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 23.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 3 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.82773114 |
|---|---|
| Coefficient of variation (CV) | 0.28999788 |
| Kurtosis | 1.8914207 |
| Mean | 2.8542662 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.30569421 |
| Sum | 8363 |
| Variance | 0.68513884 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 1597 | |
| 2 | 743 | |
| 4 | 400 | 13.7% |
| 1 | 112 | 3.8% |
| 5 | 48 | 1.6% |
| 6 | 21 | 0.7% |
| 0 | 8 | 0.3% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 8 | 0.3% |
| 1 | 112 | 3.8% |
| 2 | 743 | |
| 3 | 1597 | |
| 4 | 400 | 13.7% |
| 5 | 48 | 1.6% |
| 6 | 21 | 0.7% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 6 | 21 | 0.7% |
| 5 | 48 | 1.6% |
| 4 | 400 | 13.7% |
| 3 | 1597 | |
| 2 | 743 | |
| 1 | 112 | 3.8% |
| 0 | 8 | 0.3% |
Kitchen Qual
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 168.9 KiB |
| TA | |
|---|---|
| Gd | |
| Ex | |
| Fa | 70 |
| Po | 1 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | TA |
|---|---|
| 2nd row | TA |
| 3rd row | Gd |
| 4th row | Ex |
| 5th row | TA |
Common Values
| Value | Count | Frequency (%) |
| TA | 1494 | |
| Gd | 1160 | |
| Ex | 205 | 7.0% |
| Fa | 70 | 2.4% |
| Po | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ta | 1494 | |
| gd | 1160 | |
| ex | 205 | 7.0% |
| fa | 70 | 2.4% |
| po | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 1494 | |
| A | 1494 | |
| G | 1160 | |
| d | 1160 | |
| E | 205 | 3.5% |
| x | 205 | 3.5% |
| F | 70 | 1.2% |
| a | 70 | 1.2% |
| P | 1 | < 0.1% |
| o | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5860 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| T | 1494 | |
| A | 1494 | |
| G | 1160 | |
| d | 1160 | |
| E | 205 | 3.5% |
| x | 205 | 3.5% |
| F | 70 | 1.2% |
| a | 70 | 1.2% |
| P | 1 | < 0.1% |
| o | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5860 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| T | 1494 | |
| A | 1494 | |
| G | 1160 | |
| d | 1160 | |
| E | 205 | 3.5% |
| x | 205 | 3.5% |
| F | 70 | 1.2% |
| a | 70 | 1.2% |
| P | 1 | < 0.1% |
| o | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5860 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| T | 1494 | |
| A | 1494 | |
| G | 1160 | |
| d | 1160 | |
| E | 205 | 3.5% |
| x | 205 | 3.5% |
| F | 70 | 1.2% |
| a | 70 | 1.2% |
| P | 1 | < 0.1% |
| o | 1 | < 0.1% |
SalePrice
Real number (ℝ)
High correlation 
| Distinct | 1032 |
|---|---|
| Distinct (%) | 35.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 180796.06 |
| Minimum | 12789 |
|---|---|
| Maximum | 755000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 23.0 KiB |
Quantile statistics
| Minimum | 12789 |
|---|---|
| 5-th percentile | 87500 |
| Q1 | 129500 |
| median | 160000 |
| Q3 | 213500 |
| 95-th percentile | 335000 |
| Maximum | 755000 |
| Range | 742211 |
| Interquartile range (IQR) | 84000 |
Descriptive statistics
| Standard deviation | 79886.692 |
|---|---|
| Coefficient of variation (CV) | 0.4418608 |
| Kurtosis | 5.1189 |
| Mean | 180796.06 |
| Median Absolute Deviation (MAD) | 37000 |
| Skewness | 1.7435001 |
| Sum | 5.2973246 × 108 |
| Variance | 6.3818836 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 135000 | 34 | 1.2% |
| 140000 | 33 | 1.1% |
| 130000 | 29 | 1.0% |
| 155000 | 28 | 1.0% |
| 145000 | 26 | 0.9% |
| 160000 | 23 | 0.8% |
| 110000 | 21 | 0.7% |
| 185000 | 21 | 0.7% |
| 127000 | 20 | 0.7% |
| 120000 | 20 | 0.7% |
| Other values (1022) | 2675 |
| Value | Count | Frequency (%) |
| 12789 | 1 | |
| 13100 | 1 | |
| 34900 | 1 | |
| 35000 | 1 | |
| 35311 | 1 | |
| 37900 | 1 | |
| 39300 | 1 | |
| 40000 | 1 | |
| 44000 | 1 | |
| 45000 | 1 |
| Value | Count | Frequency (%) |
| 755000 | 1 | |
| 745000 | 1 | |
| 625000 | 1 | |
| 615000 | 1 | |
| 611657 | 1 | |
| 610000 | 1 | |
| 591587 | 1 | |
| 584500 | 1 | |
| 582933 | 1 | |
| 556581 | 1 |
Interactions
Correlations
| Bedroom AbvGr | Full Bath | Garage Cars | Gr Liv Area | Kitchen Qual | Lot Area | Overall Qual | SalePrice | Total Bsmt SF | Year Built | |
|---|---|---|---|---|---|---|---|---|---|---|
| Bedroom AbvGr | 1.000 | 0.435 | 0.122 | 0.526 | 0.093 | 0.299 | 0.078 | 0.197 | 0.056 | -0.032 |
| Full Bath | 0.435 | 1.000 | 0.340 | 0.385 | 0.233 | 0.059 | 0.306 | 0.352 | 0.198 | 0.304 |
| Garage Cars | 0.122 | 0.340 | 1.000 | 0.524 | 0.305 | 0.344 | 0.611 | 0.702 | 0.450 | 0.601 |
| Gr Liv Area | 0.526 | 0.385 | 0.524 | 1.000 | 0.227 | 0.418 | 0.578 | 0.723 | 0.379 | 0.317 |
| Kitchen Qual | 0.093 | 0.233 | 0.305 | 0.227 | 1.000 | 0.000 | 0.473 | 0.423 | 0.251 | 0.343 |
| Lot Area | 0.299 | 0.059 | 0.344 | 0.418 | 0.000 | 1.000 | 0.197 | 0.429 | 0.353 | 0.121 |
| Overall Qual | 0.078 | 0.306 | 0.611 | 0.578 | 0.473 | 0.197 | 1.000 | 0.809 | 0.473 | 0.665 |
| SalePrice | 0.197 | 0.352 | 0.702 | 0.723 | 0.423 | 0.429 | 0.809 | 1.000 | 0.606 | 0.681 |
| Total Bsmt SF | 0.056 | 0.198 | 0.450 | 0.379 | 0.251 | 0.353 | 0.473 | 0.606 | 1.000 | 0.442 |
| Year Built | -0.032 | 0.304 | 0.601 | 0.317 | 0.343 | 0.121 | 0.665 | 0.681 | 0.442 | 1.000 |
Missing values
Sample
| Lot Area | Overall Qual | Year Built | Gr Liv Area | Garage Cars | Total Bsmt SF | Full Bath | Bedroom AbvGr | Kitchen Qual | SalePrice | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 31770 | 6 | 1960 | 1656 | 2.0 | 1080.0 | 1 | 3 | TA | 215000 |
| 1 | 11622 | 5 | 1961 | 896 | 1.0 | 882.0 | 1 | 2 | TA | 105000 |
| 2 | 14267 | 6 | 1958 | 1329 | 1.0 | 1329.0 | 1 | 3 | Gd | 172000 |
| 3 | 11160 | 7 | 1968 | 2110 | 2.0 | 2110.0 | 2 | 3 | Ex | 244000 |
| 4 | 13830 | 5 | 1997 | 1629 | 2.0 | 928.0 | 2 | 3 | TA | 189900 |
| 5 | 9978 | 6 | 1998 | 1604 | 2.0 | 926.0 | 2 | 3 | Gd | 195500 |
| 6 | 4920 | 8 | 2001 | 1338 | 2.0 | 1338.0 | 2 | 2 | Gd | 213500 |
| 7 | 5005 | 8 | 1992 | 1280 | 2.0 | 1280.0 | 2 | 2 | Gd | 191500 |
| 8 | 5389 | 8 | 1995 | 1616 | 2.0 | 1595.0 | 2 | 2 | Gd | 236500 |
| 9 | 7500 | 7 | 1999 | 1804 | 2.0 | 994.0 | 2 | 3 | Gd | 189000 |
| Lot Area | Overall Qual | Year Built | Gr Liv Area | Garage Cars | Total Bsmt SF | Full Bath | Bedroom AbvGr | Kitchen Qual | SalePrice | |
|---|---|---|---|---|---|---|---|---|---|---|
| 2920 | 1894 | 4 | 1970 | 1092 | 1.0 | 546.0 | 1 | 3 | TA | 71000 |
| 2921 | 12640 | 6 | 1976 | 1728 | 2.0 | 1728.0 | 2 | 4 | TA | 150900 |
| 2922 | 9297 | 5 | 1976 | 1728 | 2.0 | 1728.0 | 2 | 4 | TA | 188000 |
| 2923 | 17400 | 5 | 1977 | 1126 | 2.0 | 1126.0 | 2 | 3 | TA | 160000 |
| 2924 | 20000 | 5 | 1960 | 1224 | 2.0 | 1224.0 | 1 | 4 | TA | 131000 |
| 2925 | 7937 | 6 | 1984 | 1003 | 2.0 | 1003.0 | 1 | 3 | TA | 142500 |
| 2926 | 8885 | 5 | 1983 | 902 | 2.0 | 864.0 | 1 | 2 | TA | 131000 |
| 2927 | 10441 | 5 | 1992 | 970 | 0.0 | 912.0 | 1 | 3 | TA | 132000 |
| 2928 | 10010 | 5 | 1974 | 1389 | 2.0 | 1389.0 | 1 | 2 | TA | 170000 |
| 2929 | 9627 | 7 | 1993 | 2000 | 3.0 | 996.0 | 2 | 3 | TA | 188000 |
Duplicate rows
Most frequently occurring
| Lot Area | Overall Qual | Year Built | Gr Liv Area | Garage Cars | Total Bsmt SF | Full Bath | Bedroom AbvGr | Kitchen Qual | SalePrice | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 4590 | 8 | 2006 | 1554 | 2.0 | 1554.0 | 2 | 2 | Gd | 209500 | 2 |
| 1 | 7018 | 5 | 1979 | 1535 | 2.0 | 0.0 | 2 | 4 | TA | 118858 | 2 |
| 2 | 10800 | 5 | 1987 | 1200 | 0.0 | 1200.0 | 3 | 3 | TA | 179000 | 2 |